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We apply in this article (non rigorous) statistical mechanics methods to the problem of counting 
long circuits in graphs. The outcomes of this approach have two complementary flavours. On the 
algorithmic side, we propose an approximate counting procedure, valid in principle for a large class 
of graphs. On a more theoretical side, we study the typical number of long circuits in random graph 
ensembles, reproducing rigorously known results and stating new conjectures. 



I. INTRODUCTION 

Random graphs [3, Q| appeared in the mathematical hterature as a convenient tool to prove the existence of graphs 
with a certain property: instead of a direct constructive proof exhibiting such a graph, one can construct a random 
ensemble of graphs and show that this property is true with a positive probability. Soon afterwards the study of 
random graphs acquired interest on its own and led to many beautiful mathematical results. A large class of problems 
in this field can be formulated in the following generic way: a graph H being given, what is the probability that a graph 
G extracted from the random ensemble under consideration contains as a subgraph? With a more quantitative 
ambition, one can define Mh {G) as the random variable counting the number of occurrences of distinct copies of H 
in G, and study its distribution. These problems are relatively simple when the pattern H remains of a finite size in 
the thermodynamic limit, i.e. when the size of the random graph G diverges. The situation can become much more 
involved when H and G have large sizes of the same order, as Mh{G) can grow exponentially with the system size. 

In this article we shall consider these questions when the looked for subgraph _ff is a long circuit (also called loop 
or cycle), i.e. a closed self-avoiding path visiting a finite fraction of the vertices of the graph. The level of accuracy of 
the rigorous results on this problem depends strongly on the random graph ensemble The regular case (when 

all vertices of the graph have the same degree c) is the best understood one. It has for instance been shown that 
c-regular random graphs with c > 3 have with high probability Hamiltonian circuits (circuits which visit all vertices 
of the graph) and the distribution of their numbers is known 0, |^ . This study has been generalized to circuits of 
all length in Less is known for the classical Erdos-Renyi ensembles, where the degree distribution of the vertices 
converges to a Poisson law of mean c. Most results concerns either the neighborhood of the percolation transition 
at c = 1 H , 0, , or the opposite limit of very large mean connectivity, either finite with respect to the size of the 
graph |lCl|| or diverging like its logarithm (it is in this latter regime that the graphs become Hamiltonian). We 
shall repeatedly come back in the following on this discrepancy between regular random graphs where probabilistic 
methods have been proved so successful and the other ensembles for which they do not seem powerful enough and 
might be profitably complemented by approaches inspired by statistical mechanics. We will discuss in particular a 
conjecture formulated by Wormald yf, according to which random graph ensembles with a minimal degree of 3 (and 
bounded maximal degree) should be Hamiltonian with high probability. 

Besides this probabilistic point of view (what are the characteristics of the random variable associated to the 
number of circuits), the problem has also an algorithmic side: how to count the number of circuits in a given graph? 
Exhaustive enumeration, even using smart algorithms Il2|, is restricted to small graphs as the number of circuits 
grows exponentially with the size. More formally, the decision problem of knowing if a graph is Hamiltonian (i.e. that 
it contains a circuit visiting all vertices) is NP-complete [T^ . A probabilistic algorithm for the approximate counting 
of Hamiltonian cycles is known |l4j , but is restricted to graphs with large minimal connectivity. 

Random graphs have also been largely considered in the physics literature, mainly in the real-world networks 
perspective (15j, i.e. in order to compare the characteristics of observed networks, of the Internet for instance, with 
those of proposed random models. Empirical measures for short loops in real world graphs were for instance presented 
in Long circuits visiting a finite fraction of the vertices were also studied in [l^. The behavior of cycles in the 
neighborhood of the percolation transition was considered in [l^ . and the average number of circuits for arbitrary 
connectivity distribution was computed in |l9j . 

In this paper we shall turn the counting problem into a statistical mechanics model, which we treat within the 
Bethe approximation. This will led us to an approximate counting algorithm, cf. Sec.^ We will then concentrate on 
random graph ensembles and compute the typical number of circuits with the cavity replica-symmetric method po| 
in Sec, mil The next two sections will be devoted to the study of the limits of short and longest circuits, then we shall 
investigate the validity of the replica-symmetry assumption in Sec. IVII We perform a comparison with exhaustive 
enumerations on small graphs in Sec. IVIII and draw our conclusions in Sec. IVIIII Three appendices collect more 



2 



technical computations. A short account of our results has been published in plj. 

II. A STATISTICAL MECHANICS MODEL AND ITS BETHE APPROXIMATION 

A. Derivation of the BP equations 

Let us consider a graph G on N vertices (also called sites in the following) i — 1, . . . ,N, with M edges (or links) 
1 = 1,..., M . The notation I = (i j) shall mean that the edge I joins the vertices i and j. The degree, or connectivity, 
of a site is the number of links it belongs to. The graphs are assumed in the main part of the text to be simple, i.e. 
without edges from one vertex to itself or multiple edges between two vertices. We denote di the set of neighbors of 
the vertex i, and use the symbol \ to subtract an element of a set: if j is a neighbor of i, di\j will be the set of all 
neighbors of i distinct of j. The same symbol di will be used for the set of edges incident to the vertex i, the context 
will always clarify which of the two meanings is understood. 

A circuit of length L is an ordered set of L different vertices, (ii, . . . , ii), such that (inin+i) is an edge of the graph 
for all n G [1,L — 1], as well as (ilh)- Two circuits are distinct if they do not share the same set of edges (i.e. the 
starting point and the orientation of a tour along the vertices is not relevant), and we denote Ml{G) the number of 
distinct circuits of length L in a graph G. 

The degrees of freedom of our model are M variables Si £ {0, 1} placed on the edges of the graph, with their global 
configuration called S_ = {5*1, . . . , Sm}- We shall also use = {Si\l £ di} for the configuration of the variables on 
the links around the vertex i. We introduce the following probability law on the space of configurations: 

where Z[u) is the normalization constant, and the weights wi, Wi are given by 

fi if E^/e{0,2} 

Wl{Sl) = u^' , w^{S^, = I . (2) 

I otherwise 

By convention Wi = 1 ii di = ^, that is to say if the vertex i is isolated. The relevance of this model for the counting of 
circuits is unveiled by the following reasoning. Each configuration S_ can be associated to a subgraph of G, retaining 
only the edges I such that Si = 1. The probability (with respect to the law O) of such a subgraph is non zero only 
if the retained edges form closed circuits (any site i is constrained by Wi to be surrounded by either or 2 edges 
of the subgraph), and in that case it is proportional to with L the number of its edges. This implies that the 
normalization factor Z{u) is the generating function of the numbers A/l(G'), 

Ziu) = Y,^''^L{G) . (3) 

L 

A precision should be made at this point: wc defined above a circuit as a self-avoiding closed path. From the weights 
on the configurations defined by Eqs. H1I2|I . Ml{G) counts in fact the number of configurations made of possibly 
several vertex disjoint circuits, of total length L. In the following we shall concentrate on the limit of large graph and 
of long circuits, and we expect the leading order behaviour of Nl not to be affected by this subtlety (see App. _ 
for a combinatorial argument in favour of this thesis) , that will be kept understood from now on-'^ . Note also that [25 
proposed a Monte Carlo Markov Chain algorithm for the evaluation of such a partition function. 

We are thus performing a canonical computation where the length of the circuits is allowed to fluctuate around 
a mean value fixed by the conjugate external parameter u. In the thermodynamic limit N oo the saddle-point 



^ The reader might think this problem would be solved by enlarging the space of the configurations Si to a Potts-like spin, Si S {0, 1, . . . ,q}, 
with the weight Wi enforcing that either all variables around i are vanishing, or two are non zero and of the same colour 1, . . . ,q. In the 
bivariate generating function Z{u, q) q is then conjugated to the number of disconnected circuits, and the limit q — > should allow to 
eliminate configurations made of several disconnected circuits. However, the Bethe approximation of this model is pathological, and we 
shall not pursue this road here. 



3 



method can be used to evaluate the sum Defining f{u) = In Z{u) and cr(£) = lnA/L=£jv, where £ = L/N is 
a reduced intensive length, one obtains: 

f{u) = max[£ In u + a{£)] . (4) 

In this limit the fluctuations of the intensive circuit length in the canonical ensemble vanishes, £ is concentrated 
around its mean value £{u) = uf'{u). The (concave hull of the) microcanonical entropy can thus be obtained from 
the canonical free-energy (with a slight abuse of terminology we shall use this denomination for f{u)) by an inverse 
Legendre transform, 

a{£) = min[f{u)^£\nu] . (5) 

u 

We shall now use the Bethe approximation to obtain an estimation of the generating function Z{u). We sketch first 
the general strategy to derive Bethe approximations of statistical models (see for a comprehensive discussion), 
before applying it to the present case. 

Consider a (non- negative) weight function u'(S^) defined on a space of configurations {S}. The computation of the 
partition function, Z = X]s^(-^)' reformulated as an extrcmization problem. Indeed, the Gibbs functional 

free-energy, 

FGibbsH=E^'v(^)ln(^) , (6) 

is minimal (in the space of normalized variational distributions) for Pv{S_) = p(S_) = 'w{S_)/Z, where it takes the value 
(— In Z). In general finding the minimum of this functional is not simpler than a direct computation of Z , however this 
formulation opens the way to variational approaches: the minimum of Fcibbs in a restricted set of trial distributions 
Pv, more easily parametrized than generic ones, yields an upper bound on (— InZ). The simplest implementation of 
this idea is the mean-field approximation, in which the trial distributions are factorized, PviS_) = YiiPii'^O- ^ natural 
refinement consists in introducing correlations between neighboring variables in the trial distributions. Consider for 
instance a weight function of the form for arbitrary and Wi. One can easily show that if the underlying graph 
G were a tree, the true probability distribution would be given by 

P{S) = (^piiSi)^ (^P.{S,)^ , (7) 

where pi and pi are the exact marginals (for instance, pi{Si) = J2s\Si Pi^)) '^^ P- When the graph is not a tree, 

this expression is not valid any more. The Bethe approximation consists however in assuming that trial probability 
distributions can be approximately written under this form even if the graph contains cycles. This yields the so-called 
Bethe free-energy, 

i^Bcthe[te}, {Pl}] - E 1^ (^^) -EE iPliSl)MSl)) ■ (8) 

^=l S, 

This free-energy is to be minimized with respects to the approximate marginals p;, pi, which have to respect two 
types of constraints: 

• Pl and Pi are normalized. 

• they are consistent, i.e. for each link I = (ij), one has 

PliSl) = E P^iS..) = E ■ (9) 

Si\S, s.\s, 



This constrained minimization can be performed considering the {pi}, {pi} as independent, at the price of the intro- 
duction of Lagrange multipliers to enforce the conditions @ . It is well known that such a procedure amounts to look 
for a fixed point of the corresponding belief propagation (BP) equations |2^. In this setting the Lagrange multipliers 
are interpreted as messages sent by variables to neighboring constraints, and vice-versa. 

Let us now apply this formalism to the specific weights defined in Eq. A peculiarity of Wi has to be kept in 
mind: it can be strictly vanishing when the geometrical constraint of having or 2 present edges around each vertex 



Vk- 




FIG. 1: The messages involved in Eq. (1121 . 



is not fulfilled. As a consequence, the approximate variational marginals Pi{S_^) have to respect this constraint, and 
vanish when 'Wi{S_^) — 0. The Bethe free-energy reads now 



N M 

FBethe[fe},ta}] = ^^p.(^,)ln(p.(5J) -^^pz(501n(pi(^0«^0 

i=l S. 1 = 1 Si 



(10) 



where the convention OlnO = has been used, for the strictly forbidden configurations with Wi{S_^) — not to 
contribute to Feetho- 

A possible parametrization of the marginals achieving the extremum of the Bethe free-energy is 



S(,j) 



(11) 



where the Ci and Ci are normalization constants, and for each link (ij) of the graph a pair of directed (real positive) 
messages has been introduced, yt^j and yj~>i- These messages obey the following BP equations. 



2^ yk~*i Uk'-^i 

k,k' £di\j 



(12) 



cf. Fig. n for 8- graphical representation. Roughly speaking, yi^j is proportional to the probability that the edge 
(ij) would be present if the constraint Wj and the weight u^' were to be discarded. Hence the form of Eq. H12|l : the 
numerator corresponds to the situation where (ij) is present, the constraint Wi imposes then that exactly one of the 
edges of di \ j is also present. The denominator states on the contrary that if (ij) is absent, either none or two of the 
edges of di \ j are present. 



The normalization constants of the marginals are easily computed, 

Ci = 1 + uyi^j yj-,i , Ci = 1 + ^ y^^i yw^i , (13) 

k,k' Gdi 

and one can check, using the BP equations, that the consistency conditions are indeed respected by these expressions of 
the marginals. Moreover the value of -FBotho at its minimum can be expressed in terms of the normalization constants 
Ci and Ci. Using this value as an approximation for — InZ(u) the free energy in the Bethe approximation can be 
written as: 



M N 



(14) 
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FIG. 2: Left: the leaf removal procedure interpreted in terms of null messages. Right: the behaviour of messages along chains 
of degree 2 vertices. 

One should also compute the length of the circuits in the configurations selected at a given value of m, £{u) — uf'{u). 
It is rather unwise to use Eq. (|14(l to compute the derivative f'{u), as this expression involves the messages which 
are solution of the BP equations and hence have a non trivial dependence on u. On the contrary the expression Hl()|l 
being variational, it is enough to compute its explicit derivative with respects to u to obtain: 



The first equality is natural, the average length of the circuits being equal to the sum of the probabilities of presence 
of all the edges of the graph. Note also that the marginal probabilities contain individually some local information: 
for instance pi{l) is the fraction of circuits of length £{u) which go through the particular link I. 

Let us now come back for an instant on the BP equations and underline two simple properties they possess. In 
Eq. (|12|l we used the natural convention that sums on empty sets are null. The first consequence is that yi^j = if 
J is the only neighbor of i, as 9i \ j = 0. In such a situation i is indeed a leaf of the graph, and no circuits can go 
through the edge (ij). Even if in general the directed message in the reverse direction yj^i is non zero, one can easily 
check that the edge {ij) does not contribute to the free energy. In other words the physical observables are unaffected 
by the leaf removal process, in which the graph G is deprived of the dangling edge (ij). Moreover this simplification 
can be iteratively repeated, until no leaves are present in the remaining graph. An illustration of this process in terms 
of the null directed messages is given in the left part of Fig. |5| This property of the BP equations reflects the fact 
that the circuits of a graph are necessarily part of its 2-core, that is to say the largest of its subgraphs in which all 
sites have connectivity at least 2. 

Consider now a site i with two neighbors j and fc, for which the BP equations read yi^j = uyk—>i and yt^k = uyj^i. 
This implies that along a chain of degree 2 sites, the directed messages follow a geometric progression, cf. the right 
part of Fig. |21 and in consequence one easily shows that the marginal probability of all edges in a chain are equal: if 
a circuit goes through one of the edges of the chain, it must go through all of it. 



The presentation of the Bethe approximation in terms of messages j23l we followed in the previous section suggests 
in a very natural way the following algorithm for the approximate counting of long circuits in a given graph: 

• initialize messages yt^j for each directed edge of the graph to some random positive values. 

• iterate the BP equations (|12l) at a given value of u until convergence has been reached. 

• using the messages solution of the BP equations, compute f{u),i{u) from Eqs. (|14I15|I . and a{£{u)) = f{u) — 
£{u)\nu (cf. Eq. ®). 

• repeat this procedure for different values of u to obtain a plot of <7{i) parametrized by u. 




(15) 



B. An approximate counting algorithm 
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This algorithm is of course far from being exact. A first limitation is that the BP equations are not a priori 
convergent, on the contrary it is easy to construct counter-examples of small graphs on which they do not reach any 
fixed point. It would thus be interesting to determine under which conditions the convergence towards an uniq ue j^ non- 
trivial) fixed point is ensured. This kind of question has been the subject of recent interest, see for instance 24, "251. 
Another possible criticism is that even in the case of convergence of the BP equations, the prediction for the number 
of loops relies on the Bethe approximation, which is an uncontrolled one. This being said, one should however keep 
in mind that for large graphs with numerous circuits, an exact enumeration 12] is computationally very expensive 
and reaches very soon the limitations of present time computers. The approximate algorithm we introduced here can 
then serve as an efficient alternative, even if its predictions should be treated with caution. 

We presented in the results of such a procedure when applied to a real-world network of the Autonomous 
System Level description of the Internet [2^, allowing to estimate the total number of circuits, the length of the 
most numerous circuits and the maximal length circuits, obtaining numbers which are far beyond the possibilities 
of exhaustive counting. We also checked the compatibility of our results with the direct enumeration of very short 
circuits. 



III. THE TYPICAL NUMBER OF CIRCUITS IN RANDOM GRAPHS ENSEMBLES 

A. Definitions 



The rest of the paper shall be devoted to the study of the number of circuits in graphs G belonging to random 
ensembles. In the regime we are interested in (long circuits of large graphs with finite mean degree), the common 
wisdom about the statistical mechanics of disordered systems is that the random variable \og{NL=Ni)/N should be 
concentrated around its average, the quenched entropy crq(£). More formally, one expects the existence of a constant 
^max G [0, 1] and a function (J(^[^) > defined on ]0,£inax] such that for any sequence with L^/N £, 



if e > £„iax , Prob[AAL„ > 0] ^ , (16) 

1 

N 



if^e]0,Cax], Ve>0 Prob 



— l0gAAi„ -fTq(£) 



> £ 



(17) 



In the second line log(O) should be interpreted as ~oo, i.e. outside of any finite interval. 

The standard probabilistic methods for proving this kind of results rely essentially on the combinatorial computation 
of the average and variance of A/l, which are then used in the Markov and Chebyshev inequalities (first and second 
method). The rigorous results on the number of circuits in regular random graphs ISSSUl have indeed be obtained 
through a refined version of the second moment method (see theorem 4.1 in '^). In this context this approach is 
limited to cases where the second moment of Al is not exponentially larger than the square of its first moment^. The 
quenched entropy is then shown to be equal to the annealed one, CTa(^) = lim log{AfiN) / N , where the overline denotes 
the average over the random graph ensemble. 

We believe that in all ensembles of graphs which are not strictly regular and have a fastly decaying connectivity 
distribution, the second moment of the number of long circuits is exponentially larger that the square of the first 
moment (details of the combinatorial computations leading to this belief are presented in App.^l, thus ruling out the 
main probabilistic techniques used so far. The annealed entropy [T9l | is in this case strictly larger than the quenched 
one, as it is dominated by exponentially rare graphs which have exponentially more circuits than the typical ones. 

We shall now follow the cavity method [2Qi |. which is particularly well suited to tackle this problem, ubiquitous in 
the field of disordered statistical mechanics models According to this view, the quenched entropy controlling 

the leading behaviour of the number of circuits in the typical graphs depends on the graph ensemble only through 
its limiting degree distribution qk We can for instance assume that the graphs are drawn uniformly among all 
graphs on vertices which have this degree distribution. Let us recall the existence in this case of a percolation 
transition |29l l30l | between a low connectivity regime where the connected components of the graph are essentially 
trees of finite size, to a percolated phase where one giant component contains a finite fraction of the vertices. 



In a different pro blem , namely the random ensemble of fc-satisfiability formulae, this limitation has been overcome by a weighted second 
moment method l27t . 

^ This is not true for the annealed entropy which depends on the "microscopic details" of the ensemble. For instance the two classical 
ensembles G{N,p = c/N) and G{N, M = Nc/2) have the same Poisson degree distribution but distinct annealed entropies, see SecfyTTl 
and App. ^ 



7 



Before proceeding with the computations, we introduce some notations used in the following, c = kqk denotes 
the mean connectivity of the graph, hence the number of edges is in the thermodynamic limit M — Nc/2. will 
be the offspring probability, that is to say the probability of finding a site of degree fc + 1 when selecting at random 
an edge of the graph and then one of its two vertices. As a site is encountered in such a selection with a probability 
proportional to its degree, qk is proportional to (fc + 1)^^+1. By normalization, 

= i^i±l)Mi . (18) 
c 

To simplify notations we shall also define the factorial moments of qk and qk as 

OO OO 

Mn = 'Ylk k{k - 1) . . . (fc - n + 1) , /i„ = ^ q^fc fc(fc - 1) . . . (fc - n + 1) , /z„ = c/i„_i , (19) 

k—n k—n 

where the last relation is a simple consequence of Eq. H18|) . 

The condition for percolation 29, 30] reads with these notations /ii > 1. We shall assume in the following that this 
condition is met: the long circuits we are studying cannot be present if the graph has no giant component. 

We restrict ourselves to fastly (i.e. faster than any power law) decaying distributions of connectivities, such that 
all their moments are finite. After stating the results for arbitrary qk we shall often specialize to Poissonian graphs 
of mean connectivity c, i.e. such that qk — e^'^c'^/fc!. 



B. The quenched computation 

In essence the computation of the quenched entropy we undertake now amounts to perform the Bethe approximation 
of the statistical model defined by Eqs. (|1I2(I for graphs generated according to the connectivity distribution qk- The 
solution of the BP equations H12|) . which depends on the particular graph on which they are applied, leads then to a 
random set of messages y. Taking at random a graph of the ensemble, and a directed edge of this graph, one finds a 
message y with probability distribution P{y; u). In the so-called cavity method at the replica-symmetric level |20|, one 
assumes that the incoming messages on this directed edge are independent random variables with the same probability 
law P(y;u). Using Eq. (|12|l . this is turned into a self-consistent equation, 

/•DO 

P{y;u) = qoS{y) + y2ik / dyiP{yi]u)...dykP{yk]u)5{y-gk{yi,.-.,yk)) , (20) 
fe=l ^0 

where we have defined 

k 

gi{yi) = uyi, 5fe(yi, ■•• = -r- — o '"v- for fc > 2 . (21) 

i + u^ L yiyj 

l<i<j<k 

The quenched free-energy is then expressed in terms of this P{y; u) as (cf. Eq. I|14|l l: 

QO poo j 

/q(") ^ dyiP{yi;u)...dykP{yk-.u)ln\l + u^ ^ yiy^ 

k=2 y l<i<i<fc 



dyiPiyi;u)dy2Piy2;u) ln(l + ^2/12/2) , (22) 



In a similar way the length of the circuits in the configurations selected by a given value of u, and the corresponding 
quenched entropy read: 

l{u) ^ ^ dy,P{y,;u)dy2P{y,;u) ™ , (23) 

2 Jo 1 + uyiy2 

^qW«)) = Uu)~£{u)\iiu . (24) 
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As appears clearly when considering Eq. H20|l . the distribution P{y; u) contains a Dirac's delta in y = 0, that is to 
say a finite fraction of the messages are strictly vanishing. Let us call r] the fraction of non-trivial messages^, and 
P{y; u) their (normalized) distribution, i.e. P(j/; u) = {1 — ij) S{y) + 1] P{y; u) where P does not contain a Dirac's delta 
in y = 0. Inserting this definition in Eq. (|20|) . one obtains the equation satisfied by rj, 

OG 

fc=0 

Besides the trivial solution jj — 0, this equation has another positive solution as soon as //i > 1, i.e. when the graph 
is in the percolating regime. One also realizes that P satisfies the equation obtained from Eq. (|20|l by replacing the 
offspring distribution q by f, defined as 

ro = 0, rfc = ^^g„(^^j77'=-i(l~77)"-'= for fc > 1 . (26) 

Finally the free-energy and the typical length of circuits (cf. Eqs. I|22l23|l ') can also be expressed in terms of the 
simplified distribution P, if one replaces q by the following distribution r: 

ro = l-^rfc, ri=0, = qJl)vH^ - vT'' for fc > 2 . (27) 

k>2 n=k ^ ^ 

It is easily verified that this modified distribution has mean cry^ , and that a relation similar to Eq. (|f 8(1 holds between 
r and f, 

(fc + I)rfc+i 

Tk = 5 . (28) 

Let us now give the interpretation of this simplification process. We have shown the equality of the quenched entropy 
of the circuits in the two ensembles defined one by qt, the other by r^. As we explained at the end of Sec. m the 
circuits of a graph G necessarily belong to its 2-core, that is to say the largest subgraph of G in which all vertices 
have a degree at least equal to two. On the dangling ends, i.e. the edges that do not belong to the 2-core, at least one 
of the two directed messages y is equal to zero. It is thus very natural to interpret the elimination of null messages in 
terms of the typical properties of the 2-core of graphs drawn from the ensemble defined by the distribution q^ . The 
fraction of edges in the 2-core should be rf, as both directed messages have to be non-zero for the edge to belong to 
the 2-core, (resp. fk) should be the connectivity (resp. offspring) distribution of the 2-core. This interpretation 
is indeed confirmed by a direct study of a leaf-removal algorithm which iteratively removes the dangling ends of a 
graph, that we present in App. In the following we shall use the distribution q or r, depending on which is simpler 
in the encountered context. 

For future use we give the explicit expressions in the case of Poissonian random graphs, 

77=1-6-"', rfc = ;^'^> forfc>2, h = - for fc > I . (29) 

fc! 7] fc! 

We now come back to the predictions of the quenched entropy and consider as a first example the case of regular 
graphs of connectivity c, for which q^ — Sk,c-i- Equation H20() on the distribution of messages has then a very simple 
solution, P{y; u) — S{y — yr(u, c)), with 



2u(c-l)-2 

yr{u, c) = J — — — — . 30 

y u^(c- l)(c- 2) 

It is then straightforward to express £{u) and aq{i{u)) from this solution. One can also eliminate the parametrization 
by u to obtain the entropy, 

a,{e,c) = -{1 - e)\n{l -£)+ (^^-eyn(l-'^\ +e\n{c-l) , (31) 



* ri was denoted 1 - C in ISH. 
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FIG. 3; Quenched entropy for a Poissonian graph of mean connectivity 3. 

which corresponds to the known results mentioned above '31^^. 

The pecuharity of the regular case for which annealed and quenched averages coincide is hinted to by the simplicity 
of this solution P{y\ u) with a single Dirac peak. As soon as qk is positive for more than one connectivity, the 
distribution P{y; u) acquires a non vanishing support, which we expect to show up as larger fluctuations in the 
numbers of circuits, and hence a difference between quenched and annealed computations. 

The equation on P{y] u) is not solvable analytically for an arbitrary connectivity distribution. Two complementary 
roads can then be followed: this distributional equation can be easily solved with a population dynamics algorithm [20j. 
One represents P{y] u) by a sample of a large number of y's, at each time step one draws a number k following the law 
ffc, extracts k values yi, . . . ,yk randomly from the population, computes the new value gk{yi, . ■ . ,yk), and replace one 
of the representant of the population by this new value. Starting from a random sample of messages, the population 
converges to a sample of message distributed according to the fixed point solution of Eq. ((201 ■ The corresponding 
physical observables are then computed from this sample of messages, which yields the prediction for (Tq{£). As an 
illustration, we present in Fig. the results of such a numerical computation for Poissonian graphs of mean degree 
c = 3. 

On the analytical side, we present in the next sections a study of two limits, for the short and longest circuits, in 
which the analytical predictions can be pushed forward. 



T I I I r 




IV. THE LIMIT OF SMALL CIRCUITS 



We shall investigate in this section the behaviour of the quenched entropy in the limit of small circuits, computing 
analytically its first two derivatives at the origin, crq(O) and o'q(O). 

Let us first show the existence of a critical value Um below which the typical configurations are deprived of any 
circuit. This transition is signaled by an instability of the trivial solution of Eq. I|2Q|I . P{y) = 5{y). Perturbing this 
distribution infinitesimally, one can expand as (j/i , . . . , yj, ) — u^i^iyi + 0{yf). In this limit , if one inserts in the 
r.h.s. of Eq. (|20() some P{y) with an infinitesimal mean Pi, one obtains another distribution with a mean (u/uni)^'i, 
with 

oo 

= ^ A:gfe = /2i . (32) 

k=l 

If M < the mean of the perturbed distributions decrease upon iteration, so P{y) = 5{y) is a stable solution. On 
the contrary if m > this solution is unstable and must fiow to a non-trivial stable fixed point. 

We shall now set up an expansion around the stability limit, u — + e with e ^ 0^. In this limit the messages 
y are supported on a scale which vanishes with e, we shall consequently define y = xe" + o(e'') with x finite, and a a 
positive exponent to be determined in a few lines. Let us denote Q{x) the distribution of the rescaled messages and 



^ These results are also a particular case of the study of polymers on regular graphs presented in |32l . 
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relate the moments of P and Q as 

poc poo 

Pn{e) = / dyy'^Piyiu^ + e) ~ e"'^ / dxx'-Q{x) = e^^Qr, . (33) 
Jo Jo 



Expanding Eq. (PT|l as 



X ^ u^'^Xi + e'^Xi- ul^e'^"' i'^Xij ^ x^xj \ , (34) 

i=l i=l \i=l I \\<i<j<k j 

we obtain at the lowest order 

e = </i2Q2e'" + i<A3Q?e'" , (35) 
Q2e'" = UmQ2e'" + <Ai2Q?e'" • (36) 
This fixes the scale a — 1/2 and the values of Qi and Q2, the first two moments of the distribution solution of 

k 



^x, 

fe=i -^0 V i=l 



(37) 



Let us now consider the consequences of this scaling on the observables /q, £, <Tq in the limit u w,„. Taking into 
account both their explicit dependence on u and the implicit one through the distribution P{y;u), one finds after a 
short computation that: 

• the expansion of /q(um + e) starts at the second order, 

/q(wm + e) - fi'^e' + O(e') , (38) 

4'^ = ^Ql + l<{^ - "m)Q2 - ^</^3Q?Q2 - l<f^4Qt . (39) 



the intensive length £ is, at its first non-trivial order. 



£{u^ + e) = £^^h + 0{e^) , £^^'> = -u^,Q\ . (40) 



• the first two derivatives of aq(£) in £ = can be obtained from the previous expressions: 



(2) 

<(0) = - ln«„, , <(0) = 1^ - ^ (41) 



Solving for Qi^2 and plugging their values in the expression (|41|l of the derivatives of the entropy one finally obtains: 

<(0) = 1„A., <(0).4(| + ^;^.) ,42, 

We can now turn to the discussion of these results, and in particular to the comparison with the annealed com- 
putation of Bianconi and Marsili 19]. Expanding their result (reproduced in Eq. (jAlGp ) in powers of £, one obtains 

c^a(O) = In Ml : cra(O) = ^2 • (43) 

C /Xj 

Let us first consider a large but non extensive circuit length, 1 <C i <C InA''. The number Ml of such circuits is, in 
the thermodynamic limit, a Poisson distributed random variable with a mean equal to 
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When L ^ 1 the most probable value of this random variable is equal to its mean, in which one can neglect the 
polynomial prefactor 1/(2L) (we recall that we assume /ii > 1 to be in the percolated regime). Consequently the 
quenched and annealed computation of the first derivative of the entropy at ^ = coincide and match the result for 
1 < L < InA^: 

A/i ~ e^-^^^/^) ~ e^'^'(o) = (Ai)^ . (45) 
On the contrary the second derivatives differ, in general, in the two computations: 

'^a(O) - <'(0) = ^377^-^(^2 - " 1))' • (46) 

As expected the annealed entropy is always greater than the quenched one at this order of the expansion. Moreover 
it is straightforward to show from the above expression that cr"(0) — cf'^{0) vanishes only if the distribution is 
supported by a single integer, in other words in the random regular graph case. 

We performed this computation using the degree distribution of the graph, however the reader will easily verify 
that Eq. 142(1 remains unchanged if one replaces qk by the connectivity distribution rk of its 2-core (the factorial 
moments /i„ gets multiplied by ?7"~^, the mean connectivity c by 77^). 

For completeness we state here the results for Poissonian graphs of average degree c, 

<(0) = lnc, a'^iO)^-^. (47) 



V. THE LIMIT OF LONGEST CIRCUITS 



A more interesting limit case is the one of maximal length circuits. Some questions arise naturally in this context: 
what is the maximal length, ^max, for which circuits of A'^^max edges are present with high probability in a given 
random graph ensemble? In particular, under which conditions these graphs are Hamiltonian, that is to say £max = 1? 
Finally, what is the number of such longest circuits, measured by the corresponding quenched entropy crq(^max)? From 
the properties of the Legendre transform (cf. Eq. I@J), these quantities can be determined by investigating the limit 
u — !■ 00 of the free-energy : 

fq{u) CaxlnU + (Tq(£,nax) • (48) 

u — *oo 

This corresponds, in the jargon of the statistical mechanics approach to combinatorial optimization problems, to a 
zero temperature limit, where — ^max (resp. crq(^niax)) is the ground-state energy (resp. entropy) density. 

It turns out that the answers to the above questions crucially depend on the presence or not of degree 2 sites in the 
2-core of the random graphs under study, we shall thus divide the rest of this section according to this distinction. 
Before that we state an equivalent expression of the free-energy which will prove more convenient in this limit, 

oo /I I. _ 9 \ 

/q(z.) = ^g,. -/fe_i(u)-— -4(u) , (49) 

k=3 ^ ^ 

where we have defined some logarithmic moments of the distribution P, 



Ik{u)^ ( dyiP{yi;u)...dykP{yk;u)ln\l + u'^ ^ y^yj] for fc > 2. (50) 

V i<^<.<fc / 

This form of fq{u) is obtained from Eq. H22|l by using the equation ((2()|l on P{y; u) and the identity 

1 + yiVj 

l + uyo9kiyi,---,yk) = — — 2 ■ (51) 

l<i<j<k 
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^q(l) 




FIG. 4: Quenched entropy for the Hamiltonian circuits in graphs with a fraction e of degree 3 vertices, and 1 — e of degree 4. 

A. In the absence of degree 2 sites in the 2-core 

One can gain some intuition on the limit u ^ oo by inspecting the behaviour of the messages in the regular case. 



Indeed, the expansion of Eq. (|30|l shows a scaling of the form y 



xu 



-1/2 



, with X (an evanescent field in the jargon 



of optimization problems) finite. Consider now the more general case of random graph ensembles with a minimal 
connectivity of 3, i.e. go = 9i = (72 = 0, which consequently implies qo — qi = 0. Thanks to the vanishing of qi, the 
equations (|20I21|I have a solution with the above scaling of y with u. One can also check numerically in particular 
cases that the distributions P{y) concentrate according to this behaviour for large but finite values of u. 
Denoting Vq{x) the distribution of the evanescent fields, one easily obtains the integral equation it obeys: 



Vo{x) ='^qk dxi 
fc=2 



Vo{xi) . . .dxkVQ{xk) 5{x - hk{xi,. . . ,Xk)) , with hk{xi, . . . ,Xk) 



k 



(52) 



l<i<j<fc 



Moreover the logarithmic moments defined in Eq. H5U|I have the following scaling in this limit, 



Ik{u) ^ \nu + Jk , with Jk = i dxi Vo(a;i) . . . dxk Volxk) In 



, l<i<j<fe 



(53) 



Plugging this equivalent in the expression H49|) for the quenched free-energy, and identifying the maximal length of 
the circuits with the coefficient of order Inu, and their entropy with the constant term, we obtain: 



^q(l) 



k=3 



qk 



k k- 

^Jk-1 - — 



-Jk 



(54) 



The identity ^max = 1 (of which we present an alternative derivation in App. Q reproduces the conjecture of 
Wormald (conjecture 2.27 in (31) that random graphs with a minimum connectivity of 3 arc, with high probability, 
Hamiltonian. Obviously the methods we used are far from rigorous and do not provide a valid proof of the conjecture. 
However they give it a quantitative flavour with the prediction of (Tq(l), the typical entropy of such Hamiltonian 
circuits. We performed a numeric resolution of Eq. (|52|) . again by a population dynamics algorithm, to compute the 
moments Jk and from them the quenched entropy crq(l). As an illustrative example. Fig. 0] presents the results of 
such a computation in the case of random graphs with a fraction e of degree 3 vertices, and 1 — e of degree 4. As a 
function of e the entropy interpolates between the rigorously known values at e = 0, e = 1, for which the graphs are 
regular. Note that the quenched and annealed entropies, even if strictly different when < e < 1, are found to be 
numerically close. For instance when e ~ 0.5, one has (Jq{l) ~ 0.2489 and cra(l) ~ 0.2501. 
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FIG. 5: A single step of the simplification algorithm (from G2 to G3) used in Sec. IV B1 



B. In the presence of degree 2 sites in the 2-core 



Let us now consider the question of the longest cycles in random graph ensembles with connectivity distribution 
qk which do not fulfill the condition go = 9i = 92 = we assumed in the previous subsection. We first present simple 
combinatorial arguments which lead to bounds on £niax and to an asymptotic expansion when there are very few 
degree 2 sites in the 2 core, before coming back to the limit u ^ 00 of the cavity approach. 



Let us call Gi a graph drawn at random from such an ensemble, and G2 its 2-core, determined for instance by the 
leaf removal algorithm detailed in App.^ G2 has the connectivity distribution defined and computed in Sec. lIIIBl 
and App. The number of sites in the 2-core is TV^corc (cf. Eg. IB9|) . with ^corc < 1 unless the original graph was 
deprived of any isolated sites and of leaves (i.e. go = 91 =0). This ^corc is clearly an upper bound on 4nax, as circuits 
cannot be longer that the number of available sites in the 2-core. 

One can derive a lower bound on ^max with the following reasoning. From G2, the 2-core of Gi, eliminate recursively 
sites of degree 2, identifying the two edges which were previously incident to it (see Fig.[SJ). When all sites of degree 2 
have been removed, one ends up with a graph, call it G3, on iV(^core — ?"2) sites, where the minimal connectivity is 3. 
Using the result of the previous section, this reduced graph typically contains circuits of length iV(£coio — »'2)- Each 
of the circuits of G3 can be unambiguously associated to a circuit of G2 , reinserting the edges which were simplified 
during the construction of G3. Obviously the reconstructed circuits of G2 are longer than the ones of G3, hence 
^ib = (^corc — ^'2) should be a lower bound for i?niax- These bounds have been used under a stronger form in the case 
of Erdos-Renyi random graphs very close to the percolation threshold (for c = 1 + 6 with N~^^^ <C (5 <C 1) in 

One can then wonder if the upper bound is saturated, in other words if the 2-core is Hamiltonian. In general the 
answer is no, as explained by the following remark. Consider a site of degree strictly greater than 2, surrounded by at 
least three neighbors of degree 2: obviously, no circuit can go through more than two of these neighbors. As soon as 
r2 > there will be an extensive number of such forbidden vertices, hence in such a case £max < ^core- The equality 
is possible only if r2 — 0, which was the case investigated in the previous section. 

Note that the gap between the lower and upper bound closes when r2 vanishes, as the 2-core becomes Hamiltonian 
in this limit. A conjecture on the behaviour of ^max in the limit r2 — > can be formulated as 



This expression has a very simple interpretation: a forbidden site in the above argument appears if a vertex of degree 
greater than 3 is surrounded by at least three vertices of degree 2. At the lowest order these forbidden sites are far 
apart from each other, iV^max is thus reduced by one unit each time this appears. This conjecture will come out of the 
cavity analysis of next subsection, we preferred to anticipate it here because of its simple combinatorial interpretation. 

Let us exemplify the bounds and the conjecture in two particular cases. Consider ensemble of graphs where a 
fraction 1 — e of vertices have degree cq > 3, the others being of degree 2, with < e < 1. The above bounds and 
estimation read 



1. 



Bounds on £, 



max 




(55) 



max _ 



< 1 , e, 



max 



= 1 - 



4(co-l)(co-2) 
3c§ 



+ 0(6^) . 



(56) 
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As a second example we consider Poissonian random graphs of mean degree c, for which the bounds read (cf. 
Eq. 121) 

- 1 - (1 - ?/) (^1 + cr/ + i (cTjf^ , 4orc - 1 - (1 - ?7) (1 + cr?) , (57) 

where rj is the solution ofrj = 1 — exp(— c??). In the limit c — > oo the fraction of degree 2 vertices in the 2-core vanishes, 
the above conjecture reads then 

Cax = 1 - (c + l)e-= - c2e-2- _ ^ (^^ + 3c - 1^ g-^- + ©(c^e"*-) , (58) 

where n is some positive integer. Most of these terms come from the expansion of ^oro, the only non-trivial one being 
— c^e~^^/6. This is in agreement with a rigorous result of Frieze [T(ll |. 

1 - (1 + (5(c))ce-= < ^max < 1 - (c + l)e-= with S{c) . (59) 



2. The large u limit in presence of sites of degree 2 

We come back to the cavity approach and investigate the large u limit in this case. The simple ansatz y ~ xu~^^^ is 
not compatible with Eqs. (|20I21|I any longer, because of the non vanishing value of fi. The most natural generalization 
which allows to close this set of equations is then y ^ xu^"^/^, where p is a relative integer (hard field). We denote 
Vp{x) the probability distribution of the evanescent fields x associated to hard fields p. For notational simplicity 
we take the Vp to be unnormalized, with dxVp{x) — Vp, and impose the condition X^pez ^ ^- Expanding 
Eqs. (|20I21|I with this ansatz, one finds 



oo poo 

Vp{x) = '^rk ^ ^P,ek{pi,..;Pk) dxiVp^{xi) . . .dxkVp^{xk)5{x - hk{pi,xi 
fe=i pi.^.^pkez" •'^ 



,Pk,Xk)) . (60) 



In order to simplify the expression of Ck and hk, we shall denote [n] a permutation of the indices which orders the 
hard fields in decreasing order, p^ij > p[2] > . . -Pm- Then 

ei(pi) 1 +Pi , efe(pi, . . . ,p/c) = niin(l -p[2]) for /c > 2 . (61) 

We also define 

{1 ^ if p[i] +p[2] < -1 

I + dkipi,xi, . . . ,pk,Xk) if +P[2] = -1 , (62) 
dk{pi,xi, . . . ,Pk,Xk) if P[i] +P[2] > -1 

J2 XiXj if = P[2] 

dk{pi,Xi,...,pk,Xk) = \ „ " -r ^ ' (63) 

X[l] L X^ II Pll] > P[2] 

i\Pi=Pl2] 



in terms of which the evanescent contribution reads 



E 



hi{pi,xi) = xi , hk{pi,xi,...,pk,Xk) = -r-, r for A; > 2 . (64) 

dk{pi,xi, . . . ,Pk,Xk) 

Within this ansatz the logarithmic moments (cf. Eq. 15UI) ) behave as 

Ik{u) ^ jj^hnu + Jk , (65) 



4^^= X! ""pi ■ ■ •''^Pfc max(l +P[i] +P[2],0) , 
Pi,...,Pfcez'= 



(66) 
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/•OO 

Jfe = y2 / dxiVp^{xi) . . .dxkVp^{xk) IndkipiyXi, . . . ,pk,Xk) ■ (67) 
Pi,...,PfceZ'' 

From the behaviour of /q we thus obtain 

l^.. = E Jfj, - ^ ^) , aq(Ca.) = E J.-i - ^ J.) ■ (68) 

fc=3 ^ ^ fc=3 ^ ^ 

Obviously this set of expressions reduces to the ones of Sec. IV Al when all the hard fields are null, which is a solution 
if and only if fi =0. 

Let us first discuss the computation of ^!max- This quantity can be obtained from the distribution Vp of the hard 
fields, independently of the evanescent ones. Integrating away the evanescent fields in Eq. (|60|l . one obtains: 

OO 



fe=l 



Besides the population dynamics method, a faster and more precise method can be devised to solve this equation. Let 
us for this purpose define Wp, the integrated form of the distribution, and ^p{x) the generating function of the f^'s: 

p— 1 OO 

= E ' "^(^^ " X! ^^^^ ■ c^^) 

p' — — OO k—1 

One can then rewrite Eq. (|69|) . after a few lines of computation based on the expression of ek{pi, . . . ,pk), under the 
form 

U^,=l^{l^WpWiW p) ^^^^^^ ^^^^ 

yw-p 1 - (1 - Wp+ijtp (Wp+i) + tp[W-p-lj - ip{Wp+lj 

It is easy to solve them numerically by iteration (both W-p and 1 — Wp vanish exponentially fast when p — > +oo, a 
cutoff on p can thus be safely introduced), and to deduce £max from the solution Vp (see Eqs. (j66l68ll V We present 
the results of this procedure for Poissonian graphs in the left panel of Fig. El along with the bounds discussed above. 

We now sketch the way to compute the expansion of £niax stated in Eq. (|55|l . In the limit fi —> 0, the distribution 
Vp tends to Sp^Q. A more precise inspection reveals that Vp = O(f^), V-p = 0{fi^) for p > 0. In order to obtain 
Eq. it is thus enough to find {v-i,vo, vi,V2, v^} at order ff, in function of the connectivity distribution f^. The 
result follow by collecting the terms of order in Eqs. Ht)6l68|l . This expansion could be in principle pursued at any 
higher order, at the price of more tedious computations. 

If one is not only interested in the length of the longest circuits, but also in the associated entropy crq(£max), one has 
to solve the complete equation (|Hn|) on the distribution of both hard and evanescent fields. This is easily done by a 
population dynamics algorithm, following population of couples {p, x), see the right panel of Fig.|2|for the results in the 
Poissonian case. However, when the value of ri is too large, there appears an instability in the resolution of Eq. (j^UJ). 
For the sake of definiteness let us consider the Poissonian case and postpone a more general discussion to the next 
section. For values of c larger than a critical value Cs^-* ~ 2.88, the evanescent fields distributions converge, whereas 
below ci'^\ the iteration brings some of them towards diverging or vanishing values. The origin of this instability can 
be traced back to the behaviour of the original messages y at large but finite u. A closer inspection of numerically 
obtained histograms of the y's reveals that in this limit they indeed obey a scaling of the form y ^ xu^~^^'^, but p 
is a relative integer only for c > ci'^\ For lower connectivities, a continuously growing fraction of the hard fields are 
half-integers. This fraction reaches one at ci"'' ~ 2.67, below which all p's are half- integers. If one allows the hard 
fields to be both integers and half-integers in Eqs. I|6()I66I67II . this instability problem is cured, which allowed us to 
obtain the (dashed) low connectivity part of the curves in Fig. We shall come back in the next section on the 
interpretation of this phenomenon. 



VI. STABILITY OF THE REPLICA-SYMMETRIC ANSATZ 



The cavity computations we have presented so far were based on the assumption of replica symmetry (RS), valid 
if the space of configurations is smooth enough. In disordered systems this assumption can be violated, we shall thus 
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FIG. 6: The length of longest cycles in Poissonian graphs of mean connectivity c (left) and the associated quenched entropy 
(right). Dotted lines in the left panel are the bounds of Eq. H57|l . The dashed part of the curves corresponds to the regime 
c < ci'^\ where we expect the replica symmetry to be broken. The inset of the right figure shows a blow-up for small 
connectivities, the replica symmetric entropy presents a local maximum (resp. minimum) at c « 2.48 (resp. c « 2.71). 



investigate its validity in the present model. More precisely, we consider the local stability of the RS ansatz in the 
enlarged space of one step replica symmetry breaking (IRSB) order parameters |23] (we leave aside the possibility of 
a discontinuous transition). In the IRSB setting, the messages y are replaced by probability distributions Q{y) over 
the states, and the recursion y ^ cjkiyi, ■ ■ ■ , yu) becomes 

Q{V) ^^J dyiQiiyi) . . . dykQkiVk) 5{y - gk{yi, . . . , yk)) W(yu ykT , (72) 

where Z is a normalization constant, m is the Parisi IRSB parameter, and W{{yk-ti}) is a reweighting factor whose 
explicit form is not needed here. The distributions Q arc themselves drawn from a distribution over distributions, 

Q{Q]- 

The replica symmetric solution studied in the main part of the text is recovered by taking the distributions Q 
concentrated on a single value y. To investigate its local stability, one gives them an infinitesimal variance v. Expanding 
Eq. 172|l in the limit of vanishing w's, one obtains the following relation: 



k 



ddkivi, ■■■,Vk 



{y.v)^ \gk{yi,...,yk),} \ tt. ■ (73) 



For the RS solution to be stable against this perturbation, the variances of the IRSB order parameters should decrease 
upon iterations of the above relation. This can be studied numerically for any random graph ensemble, by iterating 
the above relation on a population of couples {y,v), the value of k being drawn from fk- The variances v can be 
initially all taken to 1 (note that Eq. (|73|l is linear in the w's), in the course of the dynamics the w's are periodically 
divided by a number A, chosen each time to maintain the average value of v constant. After a thermalization phase A 
converges (in order to gain numerical precision one computes the average over the iterations of In A) , its limit being 
> 1 (resp < 1) if the RS solution is unstable (resp. stable). This method, pioneered in the context of the instability 
of the IRSB solution in 's^l, can be replaced by the computation of the associated non-linear susceptibility, see for 
instance |33 |. 

For regular random graphs of connectivity c, where all RS messages take the same value y-[{u, c) given in Eq. (j^OJ, 
one can readily compute the value of the stability parameter, 

^ / N (2-"(c-l))^ 
M^(C — 1)'' 

It is easy to check that < Ar(w, c) < (c — 1)~^ < 1 when u lies in its allowed range [um — {c— 1)^^, oo[, confirming 
the validity of the RS ansatz. It would have been anyhow surprising to discover an instability in this case where the 
annealed computation is exact. 

Another case which is analytically solvable is the limit ^ (i.e. u u^). Indeed, we have seen that the messages 
scales then as x{u — Um)^, and it turns out that dgk/dyi — > Um, independently of the rescaled messages x. Recalling 
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FIG. 7: Left: the stability parameter A for Poissonian random graphs, from left to right c = 1.2, 2, 3. Right: Sketched 
behaviour of the quenched entropy for generic families of random graphs. From top to bottom a control parameter drives the 
graphs towards a continuous percolation transition, the maximal length of the circuits is reduced. In the neighborhood of the 
percolation transition replica symmetry breaking takes place for large enough circuits, and should be taken into account to 
compute the dashed part of the curve. 

that "^k^kk — jli — ^ one finds A = j^i^^ < 1 in this limit: for any connectivity distribution, the RS ansatz is 
always stable in the small i regime. 

All the numerical investigations of A we conducted for ensembles with minimal connectivity 3 suggest that in this 
case the replica symmetric solution is stable for all values of u. Note that here the zero temperature limit of A can be 
studied directly at the level of evanescent fields, as dgk/dyi dhk/dxi. We thus conjecture that the whole function 
crq(£) computed with the RS cavity method is correct for these ensembles, and in particular the quenched entropy of 
Hamiltonian circuits o'q(l) stated in Eq. (|54|) . 

The situation is less fortunate for Poissonian graphs. The reader may have anticipated the appearance of non 
integer hard fields in the zero temperature limit for mean connectivities lower than Cs"*^^ as an hint of RSB. The datas 
presented in the left panel of Fig. shows indeed that for small c, the stability parameter A crosses 1 when u is 
increased above some finite value Us{c). This critical value of u increases with the mean connectivity, and an educated 

guess makes us conjecture that it diverges at ci~^^ . The rightmost curve for c = 3 shows indeed A < 1 for all the values 
of u we could numerically study. A precise extrapolation of Us(c) turned out however to be rather difficult. Note 
also that the study directly at m = oo is largely complicated here by the fact that the hard fields do not take a finite 
number of distinct values as is often the case in usual optimization problems [35l | , but extend on the contrary on all 
relative integers. In summary, the conjectured scenario is that at high enough connectivities the whole curve crq(^), 
and in particular its zero temperature limit, is correctly described by the RS computation. For lower connectivities 
there will be a critical length above which replica symmetry breaks down. We also believe that this scenario, sketched 
in the right part of Fig.[7| is valid not only in the Poissonian case, but for all families of random graph ensembles (with 
a fastly decaying connectivity distribution) with a control parameter which drives the graphs towards a continuous 
percolation transition, the fraction of degree 2 sites in the 2-core growing as the transition is approached. 

Let us finally propose an interpretation for the occurrence of replica symmetry breaking for the largest circuits in 
presence of a large fraction of degree 2 sites in the 2 core, by relating it to an underlying extreme value problem "3(|. 
In the discussion of Sec. IV B II one could indeed tag the edges I of the reduced graph G3 with a strictly positive 
integer, by counting the number of edges of G2 which were collapsed onto I. The length of a circuit of G2 is thus the 
weighted length of the corresponding circuit of G3, i.e. the sum of the labels on the edges it visits. These weighted 
lengths are correlated random variables, because of the structural constraint defining a circuit: for a given graph G3, 
not all the sums of L tags correspond to circuits of length L. When the fraction of degree 2 site is small enough, 
these correlations are sufficiently weak for the RS ansatz to treat them correctly, when long chains of degree 2 vertices 
become too numerous they somehow pin the longest circuits, which cluster in the space of configurations and cause 
the replica symmetry breaking. 
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FIG. 8: Left: Annealed and quenched entropies for the Erdos-Renyi ensembles G{N,p) and G{N,M) of mean connectivity 
c = 3, for graphs of size — 36, computed from the mean and the median of TVi, on samples of 10000 graphs. Right: the 
quenched entropy for G{N, M) a.t N = 36, and N = 54, symbols are the extrapolation in the limit N oo from several values 
of A'", solid line is the replica symmetric cavity computation. 

VII. EXHAUSTIVE ENUMERATIONS 

We present in this section the results of the numerical experiments we have conducted in order to check our analytical 
predictions. These experiments are based on the exhaustive enumeration algorithm of jl2| | which allows to generate 
all the circuits of a given graph G, and in particular to compute the numbers Nl{G) of circuits of a given length. 
This algorithm runs in a time proportional to the total number of circuits, hence exponential in the size of the graphs 
for the cases we are interested in, which obviously puts a strong limitation on the sizes we have been able to study. 

Let us begin with the investigation of the Erdos-Renyi ensembles G{N,p) and G{N,M). In the former, each of 
the N{N — l)/2 potential edges between the N vertices of the graph is present with probability p, independently of 
each other, in the latter a set of M among the N{N — l)/2 edges is chosen uniformly at random. With p = c/N 
and M = c7V/2, these two ensembles are expected to be equivalent in the large-size limit. In particular the vertex 
degree distribution converges in both cases to a Poisson law of mean c, the cavity computation thus predicts that their 
typical properties should be the same in the thermodynamic limit. This is not true for the the annealed entropies 
(Ta(^; N) = \og{AfiN)/N which are easily computed exactly even at finite sizes, see App.^ and which remain distinct 
in the thermodynamic limit. In the left part of Fig. |S1 we present the annealed and quenched entropies for both 
ensembles, computed from 10000 graphs of size A'^ = 36 and mean connectivity c = 3. The finite size quenched 
entropy has been estimated using the median of the random variables Ml ■ The annealed entropies are very different 
in both ensembles (and in perfect agreement with the computation of App.0, and clearly different from the quenched 
ones. The striking feature of this plot is the almost perfect coincidence of the median in the two ensembles; this was 
expected in the thermodynamic limit, but is already very clear at this moderate size. On the right panel of Fig. |S1 
the quenched entropy is plotted for two graph sizes, along with its extrapolated values in the thermodynamic limit, 
which agrees with the cavity computation. 

As argued above, the difference between annealed and quenched entropies can be also seen in the exponentially larger 
value of the second moment of A/l with respect to the square of the first moment. This fact is illustrated in Fig. |51 

where the analytic computation of the ratio \og{N1/ Nl )/N presented in Add. IXI is confronted with its numerical 
determination. 



We also considered the largest circuits in each graph, of length Lmax and degeneracy A/l„^^, and computed the 
averages Lraa^/N and InA/L^ax/-^ for various connectivities. Their extrapolated values in the thermodynamic limit 
are compatible with the predictions ^max and (Tq(^niax) of the cavity method, within the numerical accuracy we could 

reach. This is true also for connectivities smaller than Cs^"*, where we argued above in favor of a violation of the 
replica symmetry hypothesis: the corrections due to RSB should be smaller than the numerical precision we reached. 

Another set of experiments concerned uniformly generated graphs with an equal number of degree 3 and 4 vertices. 
We checked that the probability for such graphs to be Hamiltonian converges to 1 when increasing their size. The values 
for the annealed and quenched entropies for the Hamiltonian circuits are too close to be distinguished numerically. 
However the study of the ratio of the first two moments of Nn (see Fig. I10|l indicates that they should be strictly 
distinct in the thermodynamic limit. 
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FIG. 9: Left: the ratio of the first two moments of Nl for G{N,M) at c = 3, the symbols are numerically determined values 
which converge in the large size limit to the solid line, analytically computed in App.^ Right: finite size analysis for ^ = 1/4, 
solid line is a best fit of the form a + b/N + c/N'^ , where a is constrained to its analytic value (dashed line), and the form of 
the fit is justified in Add. 1X1 




FIG. 10: The ratio of the first two moments of A/jv for graphs with an equal fraction of degree 3 and 4 vertices. Solid line is a 
best fit of the form a + h/N + c/iV^, with a ^ 0.002, as found in App.|Xl 



VIII. CONCLUSIONS AND PERSPECTIVES 



Let us summarize the main results presented in this paper. We have proposed an approximative counting algorithm 
that runs in a linear time with respects to the size of the graph. We also presented an heuristic method to compute the 
typical number of circuits in random graph ensembles, which yields a quantitative refinement of Wormald's conjecture 
on the typical number of Hamiltonian cycles in ensembles with minimal degree 3 (Eq. ISH)) and a new conjecture on 
the maximal length of circuits in ensembles with a small fraction of degree 2 vertices in their 2 cores (Eq. (|55|l 'l. 

Several directions are opened for future work. First of all we believe that a rigorous proof of Wormald's conjecture, 
which seems difficult to reach by variations around the second moment method, could be obtained by statistical 
mechanics inspired techniques. In recent years there has been indeed a series of mathematical achievements in 
the formalization of the kind of method used in this article. One line of research is based on Guerra's interpola- 
tion method and culminated in Talagrand's proof of the correctness of the Parisi free-energy formula for the 
Sherrington-Kirkpatrick model j^SlJ . These ideas have also been applied to sparse random graphs in [s^ l40l | . Alter- 
natively the local weak convergence method of Aldous '4l'| has been successfully applied to similar counting problems 
in random graphs ^4Zj . 

There has also been a recent interest in the corrections to the Bethe approximation for general graphical 

models. It would be of great interest to implement these refined approximations for the counting problem considered 
in this paper. This should lead on one hand to a more precise counting algorithm, and on the other hand give access 
to the finite-size corrections of the quenched entropy. We expect in particular that the difference between circuits and 
unions of vertex disjoint circuits will become relevant for these corrections. 

The convergence in probability of log Ml/ N expressed by Eq. H17|) can a priori be promoted to a stronger large 
deviation principle: according to the common wisdom, the finite deviations of this quantity from cTq are exponentially 
small. A general method for computing these rate functions has been presented in '46| and could be of use in the 
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present context. An interesting question could be to compute the exponentially small probability that a random graph 
is not Hamiltonian in ensembles where typical graphs are so. 

In the algorithmic perspective, one could try to take advantage of the local informations provided by the messages. 
In particular they could be useful to explicitly construct long cycles, in a "belief inspired decimation" fashion |47l| : 
most probable edges in the current probability law would be recursively forced to be present, and the BP equations 
re-runned in the new simplified model. 

The neighborhood of the percolation transition should also be investigated more carefully, in particular the effects 
of replica-symmetry breaking onto the structure of the configuration space. 

The case of heavy-tailed (scale-free) degree distributions deserves also further work. The assumption of fast decay 
we made here is indeed crucial for some of our results: Bianconi and Marsili showed in that scale-free graphs, 
even with a minimal connectivity of 3, can fail to have Hamiltonian cycles. Other random graph models (generated 
by a growing process or incorporating correlations between vertex degrees could also been investigated. 

Let us finally mention two closely related problems which are currently studied with very similar means. Circuits 
can be defined as a particular case of fc- regular graphs, with k = 2. Replacing the number of allowed edges around 
any site from 2 to in Eq. one can similarly study the number of fc- regular subgraphs in random graph ensemble. 
The case fe = 1 corresponds to matchings, which was la rgel y studied in the mathematical literature (50i.i51j and have 
been reconsidered by statistical mechanics methods in [sj- The appearance of fc > 3- regular subgraphs in random 
graphs was first considered in [s^, see [H^ for a statistical mechanics treatment. 
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APPENDIX A: COMBINATORIAL APPROACH 



1. Notation and definitions 



We collect in this appendix the combinatorial arguments for the computation of the first and second moment of 
the number of circuits in various random graph ensembles. Let us denote A/l(G') the number of circuits of length L 
in a graph G, and Cl the set of circuits of length L in the complete graph of N vertices, its cardinality being 

Indeed, choosing such a circuit amounts to select an ordered list of the L vertices it will visit, modulo the orientation 
and the starting point of the tour. Introducing 1{H; G) the indicator function equal to 1 if is a subgraph of G, 
otherwise, we can write 

Nl{G)^ ^ I(G;G) . (A2) 

Let us now describe the random graph ensembles we shall consider in the following. The first two are the classical 
Erdos-Renyi random graph ensembles. In G{N,p), each of the N{N — l)/2 edges is present with probability p, 
independently of the others. In G{N,M), a set of M distinct edges is chosen uniformly at random among the 
N{N — l)/2 possible ones. We shall concentrate on the thermodynamic limit N oo, p = c/N and M = cN/2 
with the mean connectivity c kept finite. In this regime G{N,p) and G{N,M) are essentially equivalent: drawing 
at random from G{N,p) amounts to draw M from a binomial distribution of parameters (p, N{N — l)/2), and then 
drawing at random a graph from G{N,M). In the limit described above, the number of edges in G{N,p) is weakly 
fluctuating around M = cN/2. Moreover the degree of a given vertex in the graph converges in both cases to a 
Poisson random variable of parameter c. 

For an arbitrary degree distribution g^. of mean c, one can define the uniform ensemble of graphs obeying this 
constraint of degree distribution. A practical way of drawing a graph from this ensemble is the so-called configuration 
model [b^, defined as follows. Each of the vertices is randomly attributed a degree, in such a way that Nqk vertices 
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have degree k (we obviously skip some technical details [23: Qk should be a function of N, such that Nqk is an integer). 
2k half-links goes out of each vertex of degree k. Then one generates a random matching of the cN = 2M half-links 
and puts an edge between sites which are matched. In general one obtains in this way a multigraph, i.e. there appear 
edges linking one vertex with itself, or multiple edges between the same pair of vertices. However, discarding the 
non-simple graphs leads to an uniform distribution over the simple ones To compute averages over the graph 
ensemble, one can thus use the configuration model and condition on the multigraph to be simple. For clarity we 
shall denote Af^ (G) the number of circuits in the unconditioned multigraph ensemble. Note also that regular random 
graphs are a particular case of this ensemble, with = 6k,c- 



1. First moment computations 



a. Generalities 



Taking the average over the graphs of Eq. l|A2p leads to 

A/HG)= MC^ = MlVl (A3) 

C€Cl 

for the ensembles we are considering, where the probability Vl = HC; G) for a circuit C G to be present is 
independent of C. Before inspecting the various cases, let us state the asymptotic behaviour of Al^ in the limit 
N,L 00, £ ~ L/N finite, obtained with the Stirling formula: 

Mm - ^ ^^J_^ ^^e^(-^^(^-^)-^)(l + 0(iV-^)) for < ^ < 1 , (A4) 

Mm = -^^N^e-^{l + OiN-')) . (A5) 
In the first formula we have introduced the function h(x) = xlnx. 



b. Erdos-Renyi ensembles 

In G{N,p) the probability Vl has a very simple expression, Vl = {c/N)^ ■ The mean number of circuits thus reads 

where the first expression is valid for any N,L, and the second one has been obtained in the thermodynamic limit 
with < ^ < 1 The annealed entropy for this first ensemble is: 

a{e) =-(l-^)ln(l-£)+^((lnc)-l) . (A7) 

Note that ii £ = 1, the algebraic prefactor in (|A6|) is slightly different, 

A^=-^y|e^«'-)-i)(l + 0(iV-i)) . (A8) 



In G{N, M) the probabihty Vl reads 

Obviously this expression has a meaning only for L < ill, as there cannot be circuits longer than the total number 
of edges. This gives an exact expression for Ml{G) = MlVl for any N and L < mm{N, M). The expansion in the 
thermodynamic limit with < ^ < min(l, c/2) leads to 

Nl{G) = ^=e^^(^)(l + O(iV-i)) . (AlO) 

^ 2£VT^l^l - f 
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with the annealed entropy 

a{e) =-(l-£)ln(l-^)+ 7) +^((lnc)-2) . (All) 

Again the different algebraic prefactor in (|A10|I can be easily computed also for £ — min(l, c/2). 

Let us now make a few comments on these results. First, when c < 1, both annealed entropies are negative for all 
values oi £ > where they are well defined. Consequently Mi^(G) is exponentially small in the thermodynamic limit, 
and thanks to the so-called Markov inequality (or first moment method) valid for positive integer random variables, 

Vroh[ML (G) > 0] < AUG) , (A12) 

with high probability there are no circuits of extensive lengths in these graphs. This could be expected: the percolation 
transition occurs at c = 1, in this non percolated regime the size of the largest component is of order In A^, and thus 
extensive circuits cannot be present. 

As a second remark, let us note that for c > 1, the annealed entropy of the first ensemble is strictly positive for 
t G]0,^a(c)[, with £a(c) an increasing function which reaches the value 1 in c = e. The average number of such 
circuits is in consequence exponentially large. However one can easily convince oneself that this cannot be the typical 
behaviour. Indeed, it turns out that £a(c) is larger than the typical number of vertices in the 2-core of the graphs 
4ore(c). When £ belongs to the interval ]4orc(c), £a(c)[, typically the graphs cannot contain circuits of L = £N edges, 
however an exponentially small fraction of the graphs have 2-core larger than their typical sizes. These untypical 
graphs contribute with an exponential number of circuits to the annealed mean A/l(G), which is in consequence not 
representative of the typical behaviour of the ensemble. 

Finally, let us underline that the annealed entropies (|A7IA7|I for the two ensembles are definitely different. For 
instance, in the second ensemble, the entropy is defined only for £ < c/2: the number of edges in the graph being 
fixed at M = Nc/2, no circuits can be longer than the number of edges. On the contrary, in the first ensemble, 
arbitrary large deviations of the number of edges from its typical value are possible, even if with an exponential small 
probability. 



c. Arbitrary connectivity distribution and regular 



The expectation of the number of circuits of length L in the multigraph ensemble extracted with the configuration 
model was presented in 19] . For the sake of completeness and to make the study of the second moment simpler we 
reproduce the argument here. In this case one has 



Vl 




{cN -2L-iy. 
(cA^- 1)!! 



(A13) 



where the sum is over L2, L^, . . . positive integers constrained by X]fc=2 — L, and we used the classical notation 
{2p— 1)!! = {2p— l)(2p — 3) ... 1. The L^'s are the number of sites of degree k in the circuit, which are to be distributed 
among the Nq^ sites of degree k. The term {k{k — 1))^'' accounts for the choice of the half links around each site, 
and finally the ratio of the double factorials is the probability that the matching of half-links contains the desired 
configuration. Introducing the integral representation of the Kronecker symbol, (5„ ~ f{d6/2i7r)6^'"^^, where is a 
complex variable integrated along a closed path around the origin, this expression can be simplified in 



Vl 



1 {cN-2L-l)\\ 



k=2 



Nqk 



(A14) 



In the thermodynamic limit the integral can be evaluated by the saddle-point method, combining the expansion with 
the one oi Ml yields 



A/£(G) 
a{£) 



1 



h{c - 2£) - -h{c) + h{£) + ext 



.k=2 



qk\n{l + k{k 1)9) ~ £hie 



(A15) 
(A16) 



where here and in the following = stands for equivalence upto subexponential terms, i.e. xn = un means 
(1/A^) \og{xNlyN) ^ as A^ ^ 00. 
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In the regular case one has 



from which the prefactors are more easily computed 

= l__i==e~-W(l + O(iV-i)) for < £ < 1 , (A18) 

77^ = _^y|e^-(i)(l + 0(A^-i)) , (A19) 

a{e) = -h{l-£)-^h{c) + ^h{c-2£)+ein{c{c-l)) . (A20) 

Moreover the conditioning on the multigraph being simple can be explicitly done in the regular case @, thanks to 
the relative concentration of Ml (G) . This yields 



A/'l(G) 

■ exp 



{c-2)e / , cii-e) 



2£ \ c-2£ 



for < ^ < 1 . (A21) 



AfliG) 

We checked that the numerical findings of were in perfect agreement with these exact results. 

Note that in the regular case, this conditioning modifies the value of A/'£(G) only by a constant factor, thus the 
annealed entropy is the same in the graph and in the multigraph ensemble. It is not clear to us whether this fact 
should remain true for arbitrary connectivity distributions. 

3. Second moment computations 

a. Generalities 

We now turn to the computation of the second moment of the number of circuits, which has been inspired by |^. 
Taking the square of Eq. IHI) and averaging over the ensemble leads to 

MUG) - KCi;G)I{C2;G) = 7Vl(G)+ KCi;G)I{C2;G) (A22) 

= AUG)+ Y MlxyzVlxyz ■ (A23) 

X.Y,Z 

We have indeed isolated the term Ci = G2 in the sum, which is readily computed, from the off-diagonal terms. The 
last expression is more easily understood after having a look at Fig. ^] where we sketched the shape of the union of 
two distinct circuits. This pattern is characterized by X , the number of common paths shared by Ci and C2, Y , the 
number of edges in these paths, and Z, the number of vertices which belongs to both circuits but are not neighbored 
by any common edge. One finds 2A vertices at the extremities of the common paths, Y — X vertices in the interior 
of the common paths, hence X ~\-Y + Z vertices belong to both circuits, N — 2L + X + Y + Z to none of them. In 
consequence the sum is over X, Y, Z non-negative integers subject to the constraints: 

2L-N<X + Y + Z<L, X <Y , X = 0^Y = . (A24) 

■Mlxyz is the number of pairs of distinct circuits of the complete graph whose union has the characteristics {X, Y, Z), 
and Vlxyz is the (ensemble- dependent) probability that such pattern appears in a random graph. Let us show that 

Mlxyz X\ z\ {{L- X -Y - Z)\f (A^ - 2L + X + y + Z)! - 1 j ' ^ ^' 

where the combinatorial factor is by convention set to 1. To construct such a pattern, one has to choose among 
the A vertices those which are in Ci but not in C2 , in C2 but not in Ci (both of these categories contain L — X — Y — Z 
vertices), those in the common paths (X -|- Y) and those shared by the circuits but with no adjacent common edges 
{Z). This can be done in 

A'! 

(A26) 



Z\ {X + y)! {{L-X -Y - Z)\Y (A - 2L + A + y + Z) 
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FIG. 11: The union of two circuits. The vertices of Ci are on and above the horizontal central line, those of C2 on and below. 
In this drawing L — 13, X = 3, Y = 6, Z — 1, ni — n2 — = 1. 



distinct ways. 

Let us call rii the number of common paths of i edges, for 1 < i < i — 1, which obey the constraints rii = X 
and iui ~ Y. The X + Y sites can be distributed into such an unordered set of unorientated paths in 

(A27) 

distinct ways. We have to sum this expression on the values of Ui satisfying the above constraints. By picking up the 
coefficient of in 



one finds that 



^ HI^^'^^-S.-^^.E.- = ^(^-0 ■ ^^^^^ 

ni,...,Ti£,_i ^ ^ 

When X — Y — this factor should be one, in agreement with the above convention. 

Finally Ci is formed by choosing an ordered list of the L — X — Y — Z vertices which belongs only to it, the Z isolated 
common vertices, and of the X orientated common paths, modulo the starting point and the global orientation of 
this tour, hence a factor 

^^4^2- , (A30) 

and the same arises when constructing C2. Eq. (|A25|I is obtained by multiplying the various contributions. 
In the thermodynamic limit with (x, y, z) — {X/N, Y/N, Z/N) kept finite, Stirling formula yields 

Mlxyz = N^^^'-y^eMNm{l,x,y,z)] , (A31) 
m(/, x,y, z) = y — 2£ + X In 2 — 2h{x) + h{y) — h{z) — h{y — x) 

+ 2h{i ~y)- 2h{l -x-y-z)-h{l-2l + x + y + z) . (A32) 



b. Erdos-Renyi ensembles 



For both G{N,p) and G{N, M) ensembles, the probability Vlxyz depends only on the number of edges present in 
the union of the two circuits, Vlxyz = 'P2L-Y ■ For the non trivial range of parameters I, c where the first moment 
is exponentially large, the first term in Eq. ljA23|) can be neglected. The sum over {X, Y, Z) can be evaluated with 
the saddle-point method, yielding 



AfliG) = exp[iVr(^)] , r(^) = max[p(2^ - y) + m(£, y)] , 



(A33) 
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where 

' \^h{c)~^hic-2e)~e forG(iV,M) ' ^ ' 

and we introduced 

m{l,y) — max m(^, a;, y, z) (A35) 

= -2h{l - £) + y - 2£ + h{y) 

+max [x In 2 - 2ft,(a;) + /i(l - a; - - - x) + 2h{l - y) - 2h{l ~ x - y)] . (A36) 



The range of parameters in the various optimizations are such that 2£—l < x + y + z < t. The step between Eqs. (|A35|) 
and (|A36|) amounts to maximize m over z, which can be done analytically. It is then very easy to determine the 
function rh{£,y) numerically. Finally, defining S{£) — t{£) — 2(j{£)^ we determined numerically this function (see 
Fig. O and found that S* > for all parameters such that ct > 0: the second moment oi Ml{G) is then exponentially 
larger than the square of the first moment, which forbids the use of the second moment method to determine the 
typical value of Nl ■ 



c. Arbitrary connectivity distribution and regular 

The computation oiVLXYZ in the configuration model can be done similarly to the one of Pl (cf. Eq. (|A13p '). To 
simplify notations let us define U = 2L — 3X — Y — 2Z, V = 2X, and the multinomial coefficient 



N \ N\ 



^U,V,ZJ U\V\Z\{N -U -V - Z)\ ' 
for U + V + Z < N. We also use (fc)„ — k{k — 1) . . . (fc — n + 1). With these conventions one finds 



(A37) 



Indeed, U (resp. V, Z) is the number of vertices with two (resp. three, four) half-edges involved in the pattern, and 
(Uk, Vfe, Z^,) the number of such vertices among the ones of degree k. In consequence the sum is over non-negative 
integers with V2 = Z2 = Z3 = 0, Uk + Vk + Zk < Nk, and X^fc Uk = U, J2k = V, J2k = Z. These last three 
constraints can be implemented using the complex integral representation of Kronecker's delta, themselves evaluated 
by the saddle point method in the thermodynamic limit: 

Vlxyz = iV^-2^exp[iVp(^,a;,y,z)] , (A39) 

p{i, X, y, z) = ^h{c -U + 2y) - ]^h{c) + 2£ - y + h{2£ - 3x - y - 2z) + h{2x) + h[z) + h{l - 2£ + x + y + z) 



ext 

^1 ,^2 ,^3 



^ qk ln(l + (A:)20i + (fc)302 + (fc)4^3) ~ {2£ ~ 'ix - y - 2x) In 9^ - 2a; In 63 - z In ^3 

.fe=2 



Once this function has been determined for a given degree distribution, the exponential order of A/'£^(G) can be 
computed as 



Nl\G) = exp[7VTW] , r(£) = max[p(^, x, y, z) + m(£, x, y, z)\ , (A40) 

x,y,z 



where m is given in Eq. ljA32|) . 

In the regular case, the maximization over the 6 parameters can be performed analytically, and yields t(€) = 
2a(t) 0, proving the concentration (at the exponential order) of Ml(G) around its mean. We expect that for any 
(fastly decaying) connectivity distribution not strictly concentrated on a single integer, t{£) > 2a{£) when a{£) > 0. 
A proof of this conjecture would be a quite painful exercise in analysis that we did not undertake. We however verified 
numerically this statement for the Hamiltonian circuits of random graphs with an equal mixture of vertices of degree 
3 and 4, yielding t(1) - 2cr(l) « 0.002. 
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We have been rather loose in treating the algebraic prefactor hidden in = for the various expressions of M]^{G). 
However it is rather simple to determine the power of N in this prefactor, collecting the contributions which arise from 
the Stirling expansions, the transformation of sums into integrals, and the evaluation of the latter with the saddle 
point method. This leads to 



•^'^'^l = est (1 + 0{N-^)) exp[iV(r(^) - 2a{l))] , (A41) 

as we observed numerically in Sec. IVIII 

Note also that some informations on the structure of the space of configurations can be obtained from this kind 
of computations. The average number of pairs of circuits at a given "overlap" (number of common edges) is indeed 
obtained from the second moment computations if the parameter y is kept fixed. 



4. On the union of vertex disjoint circuits 



In the statistical mechanics treatment of the main part of the text we used a model which counts the number A/'^(G) 
of subgraphs of G made of the union of vertex disjoint circuits of total length L. We want to show in this appendix 
that, at the leading exponential order, the average of J\f'^{G) equals the one of Ml{G) in the various ensembles 
considered in this appendix. Let us denote the set of subgraphs of the complete graph on N vertices made of 
unions of vertex disjoint circuits of total length L, and Ai'^ its cardinality. As such subgraphs are still made of L edges 
connecting L vertices, A/'^(G) = A4'j^Vl, where the probability Vl is the one defined previously for the computation 
of A/l(G'). Let us define M'^ ^ the cardinality of the subset of where the subgraphs are made of A disjoint circuits. 
A short reasoning leads to 

^^.^-(iV^F ^ \,,A. Sz.a.,aSj:^.a„l, (A42) 

where the integers Ai are the number of circuits of length i in the subgraph. From this expression it is easy to check 
that M'l I = Ml, and that as long as A is finite in the thermodynamic limit, A^'^ ^ = Ml- More precisely, one 
can show that the leading behaviour of TW'^ = J2a-^'l a is not modified by contributions with A growing with N. 
Indeed, 

M'r. 



2L [t^] exp 









[I 




■4)] 



(A43) 



Ml 

where [t^]f{t) denotes the coefficient of order x in the series expansion of f{t). Evaluating the right hand side with the 
saddle point method when L oo, one can conclude that M'j^ = Ml and from the above remark M'i^{G) = Nl{G). 
As far as annealed computations are concerned, the distinction between circuits of (extensive) length L and union of 
disjoint circuits of total length L does not modify the entropy. The hypothesis made in the main part of the text is 
that this remains true for the quenched computations. 



APPENDIX B: ANALYSIS OF THE LEAF REMOVAL ALGORITHM ON RANDOM GRAPHS WITH 

ARBITRARY CONNECTIVITY DISTRIBUTION 

We want to justify in this appendix the geometric interpretation of the null messages elimination we gave in 
Sec, nil 51 Consider a random graph drawn uniformly among the ones with the connectivity distribution qj.. The 
2-core of a graph is the largest subgraph in which all vertices have connectivity at least two. It can be determined 
using the following leaf removal algorithm, which reduces iteratively the graph. At each time step, if there is at least 
one vertex of degree I, choose randomly one of them, and remove the unique edge to which it belongs. When there is 
no vertex of degree 1, the algorithm stops. At this point either all the edges have been removed and one is left with 
N isolated vertices, or there remains some isolated vertices and a subgraph in which all vertices have at least degree 
2, i.e. the 2-core of the initial graph. 

One can define more generally the g-core of a graph as the largest subgraph with minimal degree q. For Erdos-Renyi 
random graphs, the thresholds for the appearance of giant g-cores have been obtained in |55j . These results have been 
recently extended to random graphs with arbitrary connectivity distributions in [s^ , this appendix can thus be viewed 
as an informal presentation of these mathematical works, with the emphasis put on the quantitative results instead 
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of the mathematical rigor (see also [57|, jSg for an heuristic derivation in the arbitrary connectivity distributions case, 
and [S^ IHpJ for new mathematical treatments of the problem) . In the following we shall study the behaviour of the 
leaf removal algorithm through differential equations for the evolution of the average connectivity distribution along 
the execution of the leaf removal. This method is widely used in mathematics and computer science, see in particular 
[sif for a general presentation, and a detailed derivation of the equations l|B3l) . 

We shall denote T the number of steps (elimination of one edge) already performed by the algorithm, and t = T/N 
the reduced time variable. Let us call Rk{t) = Nrk{t) the average (over the choice of the initial graph and the random 
decisions taken by the algorithm) number of sites of connectivity k in the residual graph obtained after Nt time steps 
of the algorithm. The initial condition reads obviously rk{t ~ 0) ~ qk- If one calls ffe(t) the probability that the 
neighbor of the selected degree 1 vertex has connectivity fc + 1, the average evolution of the R's during the time-step 
t-^t + l/N reads 

OO 

Rk{t + l/N) - Rk{t) = Sk^o - Sk.i + Yl ^fe' (t)[-Sk^k'+i + Sk,k'] ■ (Bl) 

fe'=0 

To close this set of equations we have to express the offspring probabilities Vk (t) in terms of the connectivity distribution 
Tfc (t) . As the graph is sequentially exposed by the algorithm, the residual graph at time t is still uniformly distributed 
according to the connectivity distribution rk{t), hence 

~ (fc + l)rfc+i(t) (fc+l)rfc+i(t) 

"^'^'^^ Ekkrkit) = c-2t ■ 

In the last equality c is the initial mean connectivity ^f^kq^, which is reduced by 2/N at each time-step. In the 
thermodynamic limit the discrete time relations (jBljl become ordinary differential equations. 



ri(t) 



hit) = -1 



crjit)"^ ' 

ri{t) ^ 2r2{t) 
cri{t)^ cri{t)'^ 



■ (i\ fcrfc(0 , {k + l)rk+i{t) . , . „ 

'^fc'-^) " 77x2+ 77v2 forfc>2. (B3) 



where dotted quantities are derivatives with respects to time, and we introduced the notation rj(t) = y 1 — 

For simplicity let us first assume the existence of a cutoff fcm in the original distribution qk, qk = for k > fci„. 
As the leaf removal procedure never increases the connectivity of one site, this cutoff remains present in rk{t) for all 
times. The equations of rank fcm in the hierarchy (|B3|I is then closed on rk^. Using the fact that r]{t) = —l/{cri{t)), 
it can be written as 

r;c„(t) = -fcm^n.„,(t) , (B4) 
and easily integrated with the initial condition rk„^ (t = 0) — qk„^ as 

• (B5) 
Now one can prove by a decreasing recurrence on k from fc„i down to 2 that 

k 

.k 



^fcW = E9nUhw'(i-'?wr^' (B6) 



solves the hierarchy of equations ljB3|) . Note also that the initial conditions rfc(O) — qk are enforced by Eq. (jB6p as 
?7(0) = 1. Once rk{t) has been computed for fc > 2, the equation on ri yields 

ri(t)=-cr;(t)(l-r;(t)) + Enq„r?(t)(l-,?(t))"-i . (B7) 



Finally 7'o(i) can be obtained from the normalization condition of the r^'s. 
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We introduced the cutoff fcm to have an explicit starting point of the downwards recurrence on k. However, the 
expression HB6|I formaUy solves the hierarchy of equations ()ij3|l even for unbounded distributions , we shall therefore 
send the cutoff to infinity from now on, assuming that all the sums remain convergent. 

The 2-core is found when the leaf-removal algorithm stops, at the smallest time for which the number of degree 1 
vertices vanishes, ri{t^) — 0. This equation always admits — c/2 as a solution (the graph has then been emptied), 
however if there is a smaller solution the 2-corc is non trivial and the algorithm stops before having removed all edges. 
Calling f] = one obtains if 77 > : 

00 , 00 

k=l k=0 

which is nothing but Eq. (|25l) on the fraction of non vanishing messages obtained in the main part of the text. The 
confirmation of the interpretation given in Sec. IIII 51 follows easily: 

• the number of edges in the 2-core is equal to the initial number of edges minus the number of steps performed 
by the algorithm before stopping, Mcoro = M — Nt^, = Mrj^ . 

• the distribution of the connectivities of the sites in the 2-core is rk{t^), as expected from Eq. (|27ll . 

For completeness we also give the number of sites in the 2-core: 

00 00 

iVcorc = A^4orc , tcor^ ^^rk{U) = I qk{l - f^f - Cr^{l ~ T^) . (B9) 

fe=2 fe=0 

As it should, this number is smaller than the size of the giant component, which reads [2^l30l|: 

iVgiant^iV (^1-^9^(1 -'7)'^ ■ (BIO) 

Moreover the fraction of sites which are in the giant component but out of the 2-core is proportional to 77(1 — 77). 
Indeed, the corresponding edges bear exactly one non null directed message, in the formalism of Sec. liii fj| if both 
messages were non null the edge would be in the 2-core, if both vanished the edge would be out of the giant component. 



APPENDIX C: AN ALTERNATIVE DERIVATION OF l^^ = 1 WHEN THE MINIMAL CONNECTIVITY 

IS 3 

This appendix presents, as a consistency check, another derivation of the identity ^max = 1 for random graph 
ensemble with minimal connectivity of 3. In the main part of the text fSec. lVfl . we obtained it by inspection on the 
behaviour of the free-energy in the large u limit, because of the absence of hard fields. There exists however another 
expression of ^max, in terms of the distribution of evanescent fields, obtained by taking the large u limit in Eq. H23() : 

/•oo 

Cax = 7: / dxiVo{xi)dx2Vo{x2)— . (CI) 

2 Jo 1 + X1X2 

Let us introduce the following functional of any probability distribution law A, 

poo 

Fk[A]{x) = dxi A{xi) . . .dxk A{xk) S{x ~ hk{xi, . . . ,Xk)) , (C2) 
Jo 

such that Eq. H52I) can be rewritten in a compact way as Vq — X]fc9fc-^fc[^]i ^^'^ ^ bilinear form on the space of 
probability distribution functions, 

/•oo 

{A,B)^ dxdyA{x)B{y)-^ . (C3) 
Jo l + xy 



Consider now this form with its arguments being a distribution A and its image through the functional Fy 



k- 



{A,Fk[A])^l dxoA{xo)dx^A{x,)...dxkA{xk) ^ x,hu{x, xu) ^^^^ 
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The rational fraction in the integral can be transformed in the following way: 



fc fc 

XQhk[xi, . . . ,Xk) _ i=i _ i=i ^^^^ 



xo}_^Xt+ XiX-j o<i<]<k 

i=l l<i<j<k 

Both the denominator of this fraction and the integration measure Jli=o ^*^* ^i^i) being invariant under the permuta- 
tions of the k + 1 Xi's, the integral can be computed by symmetrizing the numerator of the fraction. The normalization 
of A then gives 

{A,Fk[A]) = ^. (C6) 

The proof of £max = 1 is now straightforward: 

oo ~ 

4.ax = ^{Vo, Vo) = E ?<^0' ^fct^o]) • (C7) 
fc=2 

Using the identity (|C6|) and the relation between q and q (cf. Eq. p8|) '). ^max is found to be the sum of qk for fc > 3, 
and hence is equal to 1 by normalization. 

We also verified numerically that in presence of degree 2 vertices, and hence of non trivial hard fields, the limit of 
Eq. (|23|l which involves both evanescent and hard fields, coincide with the expression Eq. I|58|l in terms of hard fields 
only. We believe this could be proved analytically, yet we did not find a simple way to do it. 
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